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Abstract (250 words) 

The fusion peptides (FP) play an essential role in fusion of viral envelope with cellular 
membranes. The location and properties of the FPs in the spike (S) glycoproteins of different 
coronaviruses (CoV) have not yet been determined. Through amino acid sequence analysis of S 
proteins of representative CoVs, we identified a common region as a possible FP (pFP) that 
shares the characteristics of FPs of Class-I viral fusion proteins including high Ala/Gly content, 
intermediate hydrophobicity, few charged residues. To test the hypothesis that this region 
contains the CoV FP, we systemically mutated every residue in the pFP of Middle East 
Respiratory Syndrome betacoronavirus (MERS-CoV), and found that 11 of the 22 residues in the 
pFP (from G953 to L964, except for A956) were essential for S protein-mediated cell-cell fusion 
and virus entry. The synthetic MERS-CoV pFP core peptide (95,IAGVGWTAGLo¢a4) induced 
extensive fusion of liposome membranes, while mutant peptide failed to induce any lipid mixing. 
We also selectively mutated residues in pFPs of two other B-CoVs, Severe Acute Respiratory 
Syndrome Coronavirus (SARS-CoV) and Mouse Hepatitis Virus (MHV). Although the amino 
acid sequences of these two pFPs differed significantly from that of MERS-CoV and each other, 
most of the pFP mutants of SARS-CoV and MHV also failed to mediate membrane fusion, 
suggesting that these pFPs are also the functional FPs. Thus, the FPs of 3 different lineages of B- 
CoVs are conserved in location within the S glycoproteins and in their functions, although their 
amino acid sequences have diverged significantly during CoV evolution. 

Importance (150 words) 

Within the Class-I viral fusion proteins of many enveloped viruses, the FP is the critical mediator 
of fusion of the viral envelope with host cell membranes leading to virus infection. FPs from 


within a virus family, like influenza viruses or human immunodeficiency viruses (HIV), tend to 
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share high amino acid sequence identity. In this study, we determined the location and amino 
acid sequences of the FPs of S glycoproteins of 3 B-CoVs: MERS-CoV, SARS-CoV, and MHV, 
and demonstrated that they were essential for mediating cell-cell fusion and virus entry. 
Interestingly, in marked contrast to the FPs of influenza and HIV, the primary amino acid 
sequences of the FPs of B-CoVs in 3 different lineages differed significantly. Thus, during 
evolution the FPs of B-CoVs have diverged significantly in their primary sequences, while 
maintaining the same essential biological functions. Our findings identify a potential new target 


for development of drugs against CoVs. 
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Introduction. 

Viruses are obligate intracellular parasites, and host cell membranes act as a barrier to 
virus entry. Enveloped viruses initiate infection of cells through fusion of the viral and cellular 
membranes. CoVs are enveloped and single stranded plus sense RNA viruses that cause a variety 
of diseases among many different species (1). Phylogenetically, CoVs are divided into four 
genera: alphacoronavirus (&-CoV), betacoronavirus (B-CoV), gammacoronavirus (y-CoV), and 
deltacoronavirus (5-CoV) (2). 

CoVs enter cells through the interactions of the viral S proteins with host receptors. 
Several cellular proteins have been identified as receptors for their respective CoVs. Specific 
examples include human angiotensin converting-enzyme 2 (hACE2) for SARS-CoV and human 
CoV NL63 (3, 4), human dipeptidyl peptidase IV (hDPP4) for MERS-CoV (5), bat DPP4 for bat 
CoV HKU4 (6), human aminopeptidase N (hAPN) for human CoV 229E (7), mouse 
carcinoembryonic antigen-related cell adhesion molecule 1a (mCEACAM 1a) for MHV (8). 

The CoV S protein is a Class-I viral fusion proteins. On the CoV virions, the 180-200 
kDa S proteins are found as trimers. S monomers contain two subunits called S1 and 82. S1 
contains the receptor binding domain (RBD) and is responsible for receptor recognition and 
binding, whereas S2 possesses the membrane fusion machinery (9, 10), including a fusion 
peptide (FP), two heptad repeat domains (called the N-terminal and C-terminal heptad repeats, 
HR-N and HR-C), the juxtamembrane domain (JMD) and a transmembrane domain (TMD) (Fig 

1A). 

To mediate membrane fusion, S protein must be activated, which requires both 
proteolytic cleavage (priming) and receptor binding with or without pH change (triggering) (11- 


13). Several host priming proteases are important for S protein mediated CoV entry, including 
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cathepsin B and L, serine protease TMPRSS2 and 4, trypsin, elastase, HAT, and furin (14-20). S 
protein activation leads to a series of conformational changes and insertion of a putative FP into 
target membrane, an essential step in membrane fusion and virus infection. Class-I viral fusion 
proteins generally contain one FP, located either internally, like the FPs of the glycoprotein (Gp) 
of Ebola virus and the envelope protein (Env) of avian sarcoma leukosis virus (ASLV) (21-24), 
or immediately down stream of the “priming” site, as seen in the hemagglutinin (HA) of 
influenza and the Env protein of HIV (25, 26). Although the primary sequences and lengths of 
FPs vary significantly among different Class-I viral fusion proteins, they share several common 
features. Most are rich in Ala and/or Gly, have an intermediate level of hydrophobicity with 
membrane binding potential, form helical structures in the presence of trifluoroethanol (TFE), 
and contain very few charged resides in the middle of their sequences (13, 25, 27). 

Although significant efforts have been made to locate the FPs of different CoVs (28-35), 
the exact locations and sequences of CoV FPs remains controversial. While Chambers et al 
predicted that the CoV FP was likely adjacent to HR-N (ref), Manu et al proposed that the 
sequence immediately following a critical and conserved trypsin cleavage site at the arginine of 
position 797 (R797) of SARS-CoV S protein, SFIEDLLFNK VILADAGF, may be the FP of 
SARS-CoV S protein (32). In this study, we used bioinformatics to identify a 17-22 amino acids 
long region, just upstream of HR-N, in S2 of different CoVs with characteristic features of the 
FPs of other Class-I viral fusion proteins. Using mutational, biochemical, and biophysical 
analyses of this region of the S proteins of 3 B-CoVs, MERS-CoV, SARS-CoV, and MHV, we 
provide data to support this region as the functional FP of CoV S proteins. 


Materials and Methods 
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Cell culture. HEK-293, 293T, HEK-293 cells stably expressing hACE2 (293/hACE2), HeLa 
cells stably expressing hDPP4 (HeLa/hDPP4), and HeLa cells stably expressing mouse 
CEACAM1a (HeLa/mCEACAM 1a) were maintained in Dulbecco’s modified Eagle’s medium 
(DMEM) (Invitrogen, Carlsbad, CA) supplemented with 10% fetal bovine serum (FBS) and 2% 
penicillin-stereptomycin-fungizone (Invitrogen) at 37 °C with 5% CO. 

Constructs and mutagenesis. The constructs, ppDNA-SARS-CoV SA19 (36), pceDNA-MERS- 
CoV SA16 (37), and peDNA-MHV S (38) have been described previously. Briefly, DNA 
encoding codon-optimized SARS-CoV S protein lacking the last 19aa, or MERS-CoV S protein 
lacking last 16aa but with a FLAG tag at the C-terminus, or full length MHV S protein was 
cloned between BamH I and Not I sites of pecDNA3.1. All SARS-CoV, MERS-CoV, and MHV S 
mutants were derived from the plasmid pcDNA-SARS-CoV SA19, pcDNA-MERS-CoV SA16, 
and pcDNA-MHYV §, respectively. All mutagenesis was carried out using Q5 mutagenesis kit 
(NEB, MA, USA). After the entire coding sequences were verified by sequencing, the BamH I 
and Not I containing mutated S gene was cloned back into pcDNA3.1. A plasmid encoding full- 
length hACE2 (pACE2-cq) was kindly provided by M. Farzan (Scripps Research Institute, 
Florida campus). A plasmids encoding full-length human DPP4 (pcDNA-hDPP4) was purchased 
from Sino Biological Inc (Beijing, China). A plasmid encoding full-length mouse CEACAM 1a 
(mCEACAM 1a) has been described previously (39). To express soluble human ACE2 (shACE2) 
and soluble human DPP4 (shDPP4), DNA fragments encoding residues 19-615 of human 
hACE2 with N-terminal 6his and FLAG tags and residues 40-766 of human DPP4 with C- 
terminal 6his and AVI tags were cloned between Sal I and Hind III and between BamH I and 
Xho I of modified pFASTBac1 vector with gp67 signal peptide, respectively. To express soluble 


mouse CEACAM 1a (smCEACAM 1a), residues 1-236 of mCEACAM 1a with C-terminal 6his 


(0) 
= 
6 
TO 
(0) 
— 
(72) 
O 
jae 
— 
(OL 
= 
O 
72) 
=) 
(= 
= 
TO 
v 
(ok 
(0) 
O 
O 
~ 


Journal of Virology 


Journal of Virology 


119 


120 


121 


122 


123 


124 


125 


126 


127 


128 


129 


130 


131 


132 


133 


134 


135 


136 


137 


138 


139 


140 


141 


and AVI tags were cloned into EcoR I and Not I of pFASTBac1. These soluble receptors were 
expressed in High Five insect cells using the bac-to-bac system (Invitrogen) and purified using 
nickel affinity and ion-exchange chromatography. 

Analysis of S protein expression on cell surface. Briefly, HEK-293T cells were transfected 
with 2 ug of either wild-type or mutant S protein-expressing plasmid using polyethyleneimine 
(PEI) (Polyscience Inc, Warrington, PA, USA). Forty hours later, cells were detached from 
plates by incubating with PBS+1mM EDTA for 5min at 37°C. After washing, cells were 
incubated with the respective primary anti-S antibody for | hour on ice. The primary antibodies 
for SARS-CoV SA19, MERS-CoV SA16, and MHV S protein were rabbit polyclonal anti-SARS 
S1 antibody (1:300 dilution) (Sinobiological Inc, Beijing, China), mouse monoclonal anti-MERS 
S antibody (1:300 dilution) (Sinobiological Inc, Beijing, China), and goat polyclonal anti-MHV 
S antibody (AO4) (1:200 dilution), respectively. After washing, cells were stained with Alexa 
Fluor 488 conjugated goat anti-rabbit IgG (1:200) (ZSGB-Bio LLC, Beijing, China) for SARS S, 
or goat anti-mouse IgG (1:200) (ZSGB-Bio LLC, Beijing, China) for MERS S, or rabbit anti- 
goat IgG (1:200) (ZSGB-Bio LLC, Beijing, China) for MHV S. After washing, cells were fixed 
with 1% paraformaldehyde and analyzed by flow cytometry. 

Binding of soluble receptor. HEK-293T cells were transfected with plasmids encoding either 
wild-type or mutant S proteins with PEI. After 40 hours, cells were lifted with PBS+1mM EDTA 
and immediately washed twice with PBS+2% normal donkey serum (NDS). About 2x10° cells 
were incubated with 1 ug of shACE2, or shDPP4, or smCEACAM 1a for | hour on ice. After 
washing, cells were incubated with mouse monoclonal anti-FLAG M2 antibody (1:1,000 dilution) 
(Sigma, St Louis, MO, USA) for shACE2 and followed with Alexa Fluor 488 conjugated goat 


anti-mouse IgG (1:200), or rabbit polyclonal anti-A VI antibody (1:200 dilution) (Shanghai 
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Enzyme-linked Biotechnology Co., Shanghai, China) for shDPP4 and smnCEACAM la, and 
followed with Alexa Fluor 488 conjugated goat anti-rabbit IgG (1:200). Cells were fixed with 
1% paraformaldehyde and analyzed by flow cytometry. 

Production and transduction of S protein-pseudotyped lentiviruses. Pseudovirions with 
spike proteins were produced as described previously (40) with minor modifications. Briefly, 
plasmids encoding either wild-type or mutant S proteins were co-transfected into 293T cells with 
pLenti-Luc-GFP (a gift from Dr. Fang Li, Duke University) and psPAX2 (Addgene, Cambridge, 
MA) at a molar ratio of 1:1:1 by using PEI. The following day, the cells were fed with fresh 
medium. After 24 hrs incubation, the supernatant media containing pseudovirions were 
centrifuged at 800g for 5min to remove debris, and passed through a 0.45-um filter. To quantify 
S protein-mediated entry of pseudovirions, susceptible cells were seeded at about 25-30% 
confluency in 24-well plates. The following day, cells were inoculated with 500ul of 1:1 diluted 
viruses. At 40 hours post-inoculation (PI), cells were lysed at room temperature with 120ul of 
media with an equal volume of Steady-glo (Promega, Madison, WI). Transduction efficiency 
was monitored by quantitation of luciferase activity using Modulus II Microplate Reader (Turner 
Biosystem, Sunnyvale, CA). All experiments were done in triplicate and repeated at least three 
times. 

Detection of viral spike glycoproteins by western blot. To evaluate S protein expression in 
cells, HEK 293T cells were transfected with plasmids encoding either wild-type or mutant S 
proteins by using PEI. Forty hours later, cells were lysed with lysis buffer (50 mM Tris-HCl 
pH7.4, 150 mM NaCl, 2 mM EDTA, 1% Triton X-100, 0.1% SDS) containing protease 
inhibitors (Roche, USA). To determine S protein incorporation into pseudotype virions, the 


virion-containing supernatant was pelleted through a 20% sucrose cushion at 30,000 rpm at 4°C 
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for 2 h in a Beckman SW41 rotor (40). Viral pellets were resuspended into PBS. Cell lysates and 
pseudovirion pellets were separated on a 4-15% SDS-PAGE and transferred to a nitrocellulose 
blot. The SARS-CoV SA19, MERS-CoV SA16, and MHV S proteins were detected with 
polyclonal rabbit anti-SARS S1 antibodies (1:2,000), monoclonal mouse anti-MERS S antibody 
(1:1,000), and polyclonal goat anti-MHV S antibody (1:2,000), respectively, and the blots were 
further stained with horseradish peroxidase conjugated antibodies, respectively: goat anti-rabbit 
IgG (1:10,000), goat anti-mouse IgG (1:10,000), and rabbit anti-goat IgG (1:10,000), and 
visualized with Clarity Western ECL substrate (Bio-Rad, Hercules, CA, USA). The B-actin and 
HIV capsid protein (p24) were detected using mouse monoclonal anti-B-actin antibody (1:5,000) 
(Sigma, St Louis, MO, USA) and rabbit polyclonal anti-p24 antibody (1:5,000) (Sinobiological 
Inc, Beijing, China), respectively. 

Cell-cell fusion assays. Cell-cell fusion assays were performed as previously described (37) with 
modifications. Briefly, 293T cells were co-transfected with plasmids encoding CoV S 
glycoprotein and GFP. Forty hours later, cells were detached with trypsin (0.25%) and overlaid 
on a 70% confluent monolayer of 293/hACE2, or HeLa/hDPP4, or HeLa/nCEACAM1a cells at 
a ratio of approximate one S-expressing cell to two receptor-expressing cells. After overnight 
incubation, images of syncytia were captured with a Nikon TE2000 epifluorescence microscope 
running MetaMorph software (Molecular Devices). To quantify S protein mediated cell-cell 
fusion, 293T cells were co-transfected with pFR-Luc, which contains a synthetic promoter with 
five tandem repeats of the yeast GAL4 binding sites that controls expression of the luciferase 
gene, and plasmid encoding S protein, and the receptor-expressing cells (293/hACE2, 
HeLa/hDPP4, or HeLa/mCEACAM 1a) were transfected with pBD-NF«B, which encodes a 


fusion protein with DNA binding domain of GAL4 and transcription activation domain of NF«B. 
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The following day, S expressing 293T cells were lifted with trypsin and overlaid onto receptor 
expressing cells at a ratio of about one S-expressing cell to two receptor-expressing cells. When 
cell-cell fusion occurred, luciferase expression would be activated through binding of the GAL4- 
NF«B fusion protein to GAL4 binding sites at the promoter of the luciferase gene. After 24 hrs 
incubation, cells were lysed by adding 120ul of medium with an equal volume of Steady-glo, and 
luciferase activity was measured with a Modulus II Microplate Reader. All experiments were 
done in triplicate and repeated at least three times. 

Peptide synthesis. All peptides were synthesized using a standard solid-phase FMOC (9- 
fluorenylmethoxy carbonyl) method by Scilight Biotechnology LLC (Shanghai, China). 
Purification was carried out by reversed-phase high-performance liquid chromatography (HPLC), 
and verified by mass spectrometry. An Ahx-KKK linker was added to all peptides used in 
circular dichroism (CD) spectroscopy analysis to increase peptide solubility in PBS. Peptides for 
CD analysis include: CTRL: KWGQYTNSPFLTKGF-Ahx-KKK, a control peptide from a 
previous SARS study (33); HIV FP (41): AVGIGALFLGFLGAAG-Ahx-KKK; and MERS pFP: 
SSLLGSIAGVGWTAGLSSFAAI-Ahx-KKK. Peptides for lipid mixing study include: CTRL: 
KWGQYTNSPFLTKGF; HIV FP: AVGIGALFLGFLGAAG; MERS short FP (sFP): 
IAGVGWTAGL; MERS mutant FP (mFP): IAGRGRTAGL. 

CD spectroscopy. CD spectroscopy analysis was performed to study the secondary structure of 
fusion peptides in increasing trifluoroethanol (TFE) concentrations. CD spectra were acquired on 
a Jasco J-815 spectropolarimeter (Jasco, Tokyo, Japan) using a 1-nm bandwidth with a l-nm step 
resolution from 195 to 260 nm at room temperature. Spectra were corrected by subtraction of its 
respective solvent. The sample spectrum was smoothed with a Savitsky-Golay filter. The a- 


helical content was estimated from the ellipticity value at 222nm, [0]222, according to the 


211 — empirical equation of Chen et al (42): “helical content=100* ([8]222/-395000x(1-2.57/n)) , 
212 where nis the number of peptide bonds. 

213 Preparation of liposomes. Equimolar amounts of egg phosphatidylethanolamine (PE), egg 
214  phosphatidylcholine (PC), and cholesterol (Avanti Polar Lipids, Alabaster, Ala., USA) were 


215 dried from chloroform into a thin film by constant flow of nitrogen gas, and rehydrated in Tris 
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216 buffer (10 mM Tris, 150 mM NaCl, 0.1 mM EDTA, pH7.2) at a concentration of 10 mM. Large 
217 unilamellar vesicles (LUV) were prepared by the extrusion procedure (43). Briefly, after ten 

218 freeze-thaw cycles, liposomes were extruded 21 times through two stacked polycarbonate 

219 membranes with a pore size of 0.1 um using an Avanti mini-extruder. Liposome with 0.6% 

220 (molar ratio) fluorescent resonance energy transfer (FRET) pairs Rho-PE and NBD-PE (Thermo 
221 Fisher) were prepared in the same way. 

222 Lipid mixing. Lipid mixing was determined using the resonance energy transfer assay, described 
223 by Struck et al (44) with minor modifications. Briefly, Rho-PE and NBD-PE labeled liposomes 


224 were mixed with unlabeled liposomes at a ratio of 1:9. The final lipid concentration was 300 uM. 


Journal of Virology 


225 Specified amounts of various peptides were added to initiate fusion, and changes in fluorescence 
226 — were monitored at 535 nm with the excitation wavelength set at 465 nm and a slit width of 4 nm 
227 using Fluromax-4 (Horiba, Paris, France). The initial residual fluorescence of the labeled and 
228 unlabeled vesicles was set up as baseline for 0% fluorescence value (fo); 100% fluorescence 

229 value (fio) was achieved by addition of Triton X-100 to final concentration of 0.2%. The extent 
230 of lipid mixing was calculated using the following formula: %F =(fi-fo)/(fioo-fo)* 100, where f, is 
231 the fluorescence value observed after addition of fusion peptide at time t. 


232 Results 


Journal of Virology 
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During membrane fusion, the FP of S proteins inserts into the host membranes. We 
reasoned that CoV FPs might share some common properties with the transmembrane domains 
(TMD) and that the location of the FP within the S protein might be predictable by using TMD 
prediction software programs. The FPs of HIV-1 Env and influenza HA have been studied 
extensively and their locations and amino acid sequences are known. As a proof of concept, we 
first tested whether TM software programs could accurately identify the FPs of HIV-1Env and 
influenza H1N1HA proteins. Both the FPs and TMDs of HIV-1 Env and influenza HA were 
accurately identified by two software programs, TMpred 


(http://www.ch.embnet.org/software/TMPRED_form.html) and TMHMM 


(http://www.cbs.dtu.dk/services/TMHMM,/) (Data not shown). Subsequently, we applied these 
two software programs to analyze S proteins of a wide variety of CoVs. The positions of the 
TMDs of the S proteins of all CoVs studied were correctly identified by both software programs 
(Fig 1B). In addition, both of these TMD prediction programs identified another region 
consistently flanked by YT at the N-terminus and PF at the C-terminus in all of the S proteins of 
the CoVs tested (Figs. 1B and 1C). Although the primary amino acid sequences of this region 
were not conserved in all of the CoVs studied, they were all Ala or Gly rich, relatively 
hydrophobic, and contained no charged residues, characteristics shared by the FPs of other 
Class-I viral fusion proteins (Fig. 1C). We named this region in CoV S proteins the possible FP 
(pFP). 

To investigate if the pFP is the functional fusion peptide of CoVs, we selected the S 
protein of MERS-CoV, a lineage C B-CoV, as an example. The MERS-CoV pFP contains amino 
acids 949 to 970 (Fig. 1C). Individual and occasionally double amino acid substitutions were 


introduced at each position of pFP (Fig. 1D). First, we determined if any of the mutations altered 


256 the expression of S protein in 293T cells. Consistent to our previous report (37), two bands 

257 around 200 kDa were detected in the cell lysate of 293T cells expressing wild-type (WT) S 

258 protein, likely reflecting the different glycosylation of full length S proteins during transport 

259 through the Golgi apparatus. However, the cell lysate also contained a significant proportion of S 


260 protein cleaved between S1 and 82, around 100 kDa, which was absent in our previous report, 
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261 but previously reported by the Pohlmann laboratory (45). The difference between this study and 
262 our early report likely resulted from different culture conditions, especially sera and media from 
263 different vendors. Among the total 44 G, A, V, or R substitutions, 30 (S949G, S950G, L952A, 
264 G953A, G953R, S954G, S954R, 1955G, A956V, A956R, G957A, G957R, V958G, V958R, 

265 1955G/V958G, G959A, G959R, W960G, W960R, V958G/W960G, T961A, A962V, A962R, 

266 G963A, G963R, L964G, L964R, S965G, S966G, and A968V) showed no or minor effects on S 
267 protein expression or processing when compared to WT(Fig. 2A and Table 1). On the contrast, 
268 14 substitutions (L951G, L952G, L951G/L952G, S965R, S966R, F967G, L964F/F967G, A968R, 


269 A969V, AI69R, I970G, P971V, F972G, and I970G/F972G) showed significant reductions in S 
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270 protein expression and changes in patterns of S protein processing (Fig. 2A and Table 1). The 
271 cleaved S protein species were almost absent in corresponding cell lysates, suggesting that these 
272 residues (L951, L952, S965, S966, F967, A968, A969, 1970, P971, and F972) may be important 
273 for S protein folding and processing. 

274 We then investigated if any amino acid substitutions in the pFP influenced transport of 
275 _ the S protein to the cell surface. The 293T cells expressing WT or mutant S proteins were 

276 incubated on ice with mouse monoclonal anti- MERS-CoV S protein antibody and analyzed by 
277 flow cytometry. The same 30 mutants that showed WT levels of S protein expression in cell 


278 lysates also showed WT levels of S protein on the cell surface (Fig. 2B and Table 1). As 
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expected, the mutants with defects in S protein expression and processing also showed only low 
levels of S proteins on the cell surface. 

Although the pFP is located within the MERS-CoV S2 subunit, amino acid substitutions 
in pFP might affect S protein binding to its cognate receptor, hDPP4, by altering the overall 
conformation of the S protein. To determine whether or not any amino acid substitution in pFP 
changed S protein binding to hDPP4, we used V5-tagged soluble hDPP4 (shDPP4) to bind 293T 
cells transiently expressing WT or pFP mutant S proteins of MERS-CoV. The percentage of cells 
that bound shDPP4 and the level of shDPP4 bound to S protein were quantitated by flow 
cytometry. The same 30 mutant S proteins that showed WT levels of expression on cell surface 
also bound to shDPP4 at levels similar to WT S protein (Fig. 3 and Table 1), indicating that these 
pFP mutations had no effect on receptor binding. 

Because the fusion peptide is essential for S protein-mediated membrane fusion, we then 
explored whether any mutation in pFP altered MERS-CoV S protein-mediated cell-cell fusion. 
To more easily visualize cell-cell fusion or syncytia, the 293T cells expressing S protein were co- 
transfected with a GFP-expressing plasmid, then overlaid with HeLa/hDPP4 cells in the presence 
of trypsin. Consistent with our previous report (37), WIT MERS-CoV S protein induced very 
large syncytia (Fig 4) and syncytia formation depended on the availability of hDPP4 (data not 
shown). Among 30 pFP S protein mutants that were expressed well, transported to the cell 
surface efficiently, and bound to hDPP4 at levels similar to WT, 14 mutants (S949G, S950G, 
G953A, $954G, A956V, A956R, G957A, V958G, GIS59A, A962V, GI63A, L964G, S965G, and 
A968V) induced large syncytia in HeLa/hDPP4 cells similar to WT, while 12 mutants (G953R, 

S954R, 1955G, G957R, V958R, 1955G/V958G, W960R, V958G/W960G, T961G, A962R, 


G963R, and L964R) induced little or no syncytia formation, and 4 mutants (L952A, G959R, 
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W960G, and S966G) induced syncytia of much smaller size than WT (Fig 4). These results 
indicate that these 13 residues, L952, G953, $954, 1955, G957, V958, G959, W960, T961, A962, 
G963, L964, and S966, in MERS-CoV S protein are critical for S protein-mediated, receptor- 
dependent membrane fusion that would lead to virus infection. 

To quantify the effect of amino acid substitutions on S protein-mediated syncytia 
formation, we utilized a luciferase-based quantification assay from a yeast two hybrid system 
from Stratagene-Agilent Technologies, Inc. Compared to mock transfection and parental HeLa 
cell controls, fusion of 293T cells expressing WT MERS-CoV S proteins with HeLa/hDPP4 cells 
increased luciferase activity by about 1,000-fold (Fig.5). The overall pattern of cell-cell fusion 
induced by pFP mutants in this quantification assay was very similar to our visual method (Fig. 4 
and 5, Table 1). Among the same 30 mutants showing WT level of expression and receptor 
binding, 16 mutants (S949G, S950G, L952A, G953A, S954G, A956V, A956R, G957A, V958G, 
G959A, A962V, G963A, L964G, S965G, $966G, and A968V) retained 50-110% of WT level 
fusion activity, but 14 mutants (G953R, S964R, 1955G, G957R, V958R, 1955G/V958G, G9S59R, 
W960G, W960R, V958G/W960G, T961G, A962R, G963R, and L964R) reduced S protein- 
mediated cell-cell fusion by more than 85%, indicating that these residues (G953, S954, I955, 
G957, V958, G959, W960, T961, A962, G963, and L964) are essential for membrane fusion. 

To determine whether or not any mutation in the pFP of the S protein of MERS-CoV also 
affected virus entry, we measured transduction of HeLa/hDPP4 cells by lentiviral pseudovirions 
with envelopes containing either WT or pFP mutant MERS-CoV S proteins. Compared to mock 
control (pseudovirions without any S protein), the luciferase activity in HeLa/hDPP4 cells 
increased by more than 10,000 fold following transduction by pseudovirions with WT MERS- 


CoV S proteins (Fig. 6A). Among the same 30 mutants that showed little or no effects on S 
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protein expression or receptor binding (Figs 2A, 2B, 3, and Table 1), 5 mutants (L952A, G953A, 
G953R, G963R, and S966G) showed marked reduction in S protein incorporation into 
pseudovirions, whereas the S proteins of the other 25 mutants were incorporated into 
pseudovirions as well as WT S protein (Fig. 6B). Ten out of these 25 amino acid substitutions, 
S954R, 1955G, G957R, V958R, 1955G/V958G, W960R, V958G/W960G, T961G, A962R, or 
L964R, almost abolished MERS-CoV S protein-mediated, receptor-dependent pseudovirion 
entry (Fig 6A and Table 1), suggesting that $954, 1955, G957, V958, W960, T961, A962, and 
L964 are essential for virus entry. In addition, G959R mutation also reduced the transduction by 
more than 95%, indicating that G959 may also be critical for virus entry too (Fig. 6A). 
Interestingly, although G953A, G953R, and G963R mutants showed reduced but similar levels 
of S protein incorporation into pseudovirions (Fig. 6B), the infectivity of the pseudovirions 
differed drastically. While G953A result in only 30% of WT level of pseudovirion entry, the 
G953R and G963R mutations almost abrogated S protein mediated pseudovirion entry, 
indicating that G953 and G963 may also be important for virus entry. 

Because the FPs of most Class-I viral fusion proteins fold predominantly in an o-helix 
structure in the presence of TFE (13), we used circular dichroism spectroscopy (CD) analysis to 
investigate whether our MERS-CoV pFP also adapts an a-helical fold. A scrambled peptide 
from a previous SARS-CoV study (33) was chosen as the negative control, and the FP of HIV-1 
was selected as the positive control (46). To facilitate the synthesis of the peptides and increase 
their solubility, an aminocaproic acid (Ahx) linker followed by 3 Lys residues (Ahx-KKK) was 
added to the C-termini of the peptides. Consistent with the previous reports (46), while the FP of 
HIV-1 folded as a random coil in Tris/salt buffer, it formed an o-helix in the presence of 


trifluoroethanol (TFE) (Fig. 7A), a solvent known to stabilize the o-helical formation (47). 
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Similarly, in the absence of TFE, the pFP of MERS-CoV (SSLLGSIAGVGWTAGLSSFAAIT) 
folded as a random coil, but with the addition of TFE, it folded as an o-helix. At 95% of TFE, 
helical population accounted for more than 64% (Fig. 7A). 

FPs of Class-I viral fusion proteins also promote membrane fusion when mixed with 
liposomes. Accordingly, we investigated whether the pFP of MERS-CoV S protein could 
mediate liposome fusion using a FRET-based assay. To rule out any possible effect of the Ahx- 
KKK tag, we decided to use peptides without any tag. However, because of the technical 
difficulty of synthesizing the full length pFP without the AHX-KKK tag, we decided to use 
instead the core sequence of pFP (9sIAGVGWTAGLo¢a, called “short pFP” or sFP) in this study, 
in which almost all of the residues were shown to be essential for cell-cell fusion and virus entry. 
As shown in Fig 7B, both the FP of HIV-1 and the sFP of MERS-CoV induced membrane fusion 
of liposome in a concentration dependent manner, whereas the negative control peptide did not 
induce any significant lipid mixing. Moreover, when we replaced V958 and W960, two residues 
essential for cell-cell fusion and virus entry, with Arg in the MERS-CoV sFP peptide, the 
resulting mutant FP (mFP) (95sIAGRGRTAGLo¢4) failed to induce any noticeable lipid mixing, 
confirming that these two residues are essential for lipid mixing. 

Having established the essential roles in membrane fusion and virus entry of the pFP of 
the S protein MERS-CoV, a B-CoV in group C, we also investigated the functional role of the 
pFPs of other CoVs. After examining the alignment of the pFPs of different CoVs (Fig. 1B), we 
selected the pFPs of the S proteins of SARS-CoV, a lineage B B-CoV, and MHV, a lineage A B- 
CoV, for functional study. While the pFP of SARS-CoV shares the same length and has about 
1/3 of amino acid sequence identity with the pFP of MERS-CoV, the pFP of MHV differs 


markedly from that of MERS-CoV in both length and amino acid sequence. Since hydrophobic 
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residues in the pFP of MERS-CoV play important roles in membrane fusion, we selected W868, 
F870, L876 and 1878 of SARS-CoV S protein and M936, F937, P938, P939, and W940 of MHV 
S protein for further analysis. Single Arg and/or Gly substitutions were introduced into the MHV 
and SARS-CoV S proteins at these positions. 

With the exception of I878-related mutants, the pFP mutant S proteins of SARS-CoV 
were expressed well (data not shown), bound well to its receptor, hACE2, at levels similar to WT 
(data not shown), and were incorporated into pseudovirions efficiently (Fig 8B). 1878 mutants 
(1878G, I878R, and double mutant L876G/I878G) were expressed slightly less well in cell 
lysates (data not shown) and showed reduced S protein incorporation into pseudovirons (Fig. 8B), 
indicating that I878 may play a role in folding and transport of S protein. Similar to MERS-CoV 
S protein, all Arg mutations in pFP of SARS-CoV effectively abolished S protein mediated cell- 
cell fusion and virus entry (Fig. 8A, 8C, and Table 1), suggesting that these residues are indeed 
essential for membrane fusion. Compared to Arg mutations, Gly substitutions in the pFP of 
SARS S protein had less effect on cell-cell fusion and virus entry. Interestingly, although the 
single mutants, W868G and F870G, showed almost WT level infection, the double mutant 
W868G/F870G abolished S protein mediated virus entry (Fig. 8A), confirming that these two 
residues in S protein of SARS-CoV are important for membrane fusion. 

All MHV S protein pFP with single Arg substitutions (M936R, F937R, P938R, P939R, 
and W940R) showed significant reduction in both S-mediated pseudovirion entry (Fig. 8D and 
Table 1) and cell-cell fusion (Fig. 8F and Table 1). S proteins with M936R substitutions, 
however, showed significantly decreased expression of S protein in cell lysate (data not shown) 
and incorporation into pseudovirions (Fig. 8E). This may partly explain why M936R mutations 


had detrimental effects on virus infection and cell-cell fusion. P938R substitution also showed 
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slight reduction in expression and virion incorporation of S protein. In contrast, S proteins with 
F937R, P939R, and W940R substitutions had wild-type levels of S protein expression (data not 
shown) and incorporation into virions (Fig. 8E), and binding to its cognate receptor (data not 
shown), mCEACAM 1a, but failed to mediate virus entry or syncytia formation. These data 
indicate that F937, P939, and W940 in the pFP may be essential for MHV S protein-mediated 
membrane fusion. 

Discussion. 

Proteolytic priming is one of the early essential steps required to activate the fusion 
potential of Class-I viral fusion proteins, and is believed to release the restrain on the viral FP 
leading to exposure of the FP. The proteolytic priming sites for most of the Class-I viral fusion 
proteins are either immediately proximal to or not far upstream of the viral FP (21-26). Therefore, 
identifying the key proteolytic priming site may lead to discovery of a viral FP. However, in the 
case of CoVs, the priming sites are less clear. In an attempt to identify the trypsin cleavage site 
essential for MERS-CoV S protein mediated trypsin-dependent entry, we mutated several trypsin 
sites (R884G/R887G, K897G, R921G, and K933G) upstream of the N-terminus of HR-N of 
MERS-CoV S protein (48, 49). Surprisingly, we found that none of these sites was essential for 
trypsin-primed MERS-CoV S protein-mediated virus entry (Data not shown). Therefore, there 
might be built-in redundancy of trypsin priming sites within the MERS-CoV S protein such that 
cleavage by trypsin might occur at multiple sites and single cleavage at any one of these sites 
might be sufficient to prime the MERS-CoV S protein. 

Since there was not a single essential trypsin priming site for the S protein of MERS-CoV, 
we used an alternative approach to look for the FP of MERS-CoV S protein. Using TMpred and 


TMHMM software programs to analyze the S2 domains of a variety of CoVs, we identified a 
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region in 82 that is flanked by YT at the N-terminus and PF at the C-terminus and found in all 
the CoVs studied (Figs 1B and 1C). This pFP region has characteristics of the known FPs of 
other Class-I viral fusion proteins, Gly or Ala rich, relatively hydrophobic, and without charged 
residues. This pFP region is located at about 7-23 amino acids upstream of the N-terminus of 
HR-N of CoV S proteins, depending on where the N-terminus of HR-N was proposed (48-53). 
Mutagenesis analysis on the pFPs of MERS-CoV, SARS-CoV, and MHV S proteins revealed 
that this region was essential for S protein mediated syncytia formation and virus entry (Table 1), 
and strongly support the idea that the pFP of B-CoV S protein is the functional viral fusion 
peptide. This conclusion is further strengthened by our findings that the synthetic pFP of MERS- 
CoV S protein formed an o-helix in the presence of TFE and its core short sequence, called sFP, 
mediated membrane fusion of liposome efficiently (Fig 7), which are characteristics of FPs of 
other Class-I viral fusion proteins (13). Our results are also consistent with previous biophysical 
studies on synthetic peptides from SARS-CoV S protein (29, 33) and previous studies in MHV 
showing that P939 may be critical for membrane fusion and virus infection (54, 55). 

About one third of the residues located at the C-terminus of the pFP of MERS-CoV S 
protein appear to play important roles in the stability and processing of the S protein, since 
introduction of amino acid substitutions into these positions significantly reduced S protein 
expression, processing and incorporation into pseudotyped virions. Residues close to the C- 
terminus of the pFP of the SARS-CoV S protein also appear to be important for S protein folding, 
as replacement of I878 with R or G also decreased S protein expression and incorporation into 
virions. However, this region might also be important for membrane fusion mediated by S 
protein. A recent study on SARS-CoV by Liao et al (56) raised the possibility that this region 


might make direct interactions with the JMD in S protein during membrane fusion. 
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Among the amino acid substitutions that we introduced into the pFP of MERS-CoV S 
protein, Arg had a more profound effect on the function of the pFP of MERS-CoV S protein than 
Gly, Ala, or Val. Compared to Gly, Ala, and Val, Arg is positively charged and its side chain is 
significant longer than Gly, Ala, or Val, hence Arg substitution represents a more dramatic 
change than these amino acids. Moreover, Arg substitution may cause a higher free energy 
barrier for insertion of FP into membrane (57). Although the exact mechanism(s) of how these 
substitutions in the pFP abrogate membrane fusion requires further investigation, there are 
several possibilities. Introduction of mutation(s) into the pFP of MERS-CoV S protein might 
distort the structure of FP required for membrane fusion similar to G1V and W14A mutations of 
the FP of influenza HA (58-60). Alternatively, the substitutions might change how the FP inserts 
into membranes (61-63), or affect the oligomerization of the FPs that is important for membrane 
fusion (64, 65). 

Recent studies in influenza HA (66), paramyxovirus F protein (67), and HIV Env (68) 
reveal that many viral FPs interact and oligomerize with their TMDs in the lipid, which promotes 
lipid mixing and membrane fusion. Whether the FP and TMD of CoV S protein interact with 
each other during membrane fusion remains to be further determined. Interestingly, the primary 
amino acid sequences of the TMDs among different CoVs also do not share high identity (Fig 9). 
Of note, there is a GXXXG or (small)XXX(small) motif (G, Gly; small, Ala or Gly or Ser; X, 
any residue) present in all of the pFPs of CoVs. These motifs were initially discovered in human 
glycophorin A and have subsequently been implicated in TMD interactions of more than 20 
proteins (69). Recent studies in influenza HA and HIV Env have suggested that such GKXXG 
motifs may also play an important role in FP:FP or FP:TMD interaction (66, 68, 70). There are 


two GXXXG motifs, GSIAG and GWTAG, within the FP of MERS-CoV. Replacement in 
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MERS-CoV S protein of any one of these four Gly residues (G953, G957, G959, or G963) with 
Arg abrogated the membrane fusion activity of the viral protein. However, whether these 
GXXXG motifs in the pPF of MERS S protein are essential for oligomerization or interaction 
with the TMD requires further investigation. 

FPs of some Class-I viral fusion proteins, like HIV Env and influenza HA, share high 
identity in primary amino acid sequence within each virus family. In marked contrast, this study 
found no strong amino acid sequence identity among the pFPs of MERS-CoV, SARS-CoV, and 
MHV. The lengths of the FPs of these three different lineages of B-CoVs also differ significantly, 
ranging from 18 for MHV to 22 amino acids for MERS-CoV and SARS-CoV (Fig. 1B). Within 
each lineage of B-CoVs, the pFPs appear to be better conserved (Fig 1B). Although underline 
mechanism(s) causing the amino acid sequence diverge of FPs of different lineages of B-CoVs 
remains to be determined, CoV RNA-dependent RNA polymerase error, recombination, and 
selective pressure during evolution likely contribute to these changes. Previous study of MHV 
persistent infection in DBT cells showed that accumulation of mutations in fusion peptide and 
HR-N could lead to extending host range (55). The lack of conservation of the pFP amino acid 
sequences, however, is not unique for CoVs, as FPs from different paramyxoviruses also lack 
high identity in their primary amino acid sequences (67). 

As an internal fusion peptide, how does the activated FP of CoVs fold and mediate 
membrane fusion? Recent studies have demonstrated that FPs from different Class-I viral fusion 
proteins might adapt different conformations to mediate membrane fusion. Depending on the 
lipid composition, the FPs of HIV-1 Envs and PIV F proteins can fold as either o—helix (67, 71) 
or B-sheet (65, 72), and both can be fusiogenic. In contrast, the overall conformation of the FPs 


of Ebola Gp and influenza HA is o-helical in the presence of TFE, but they fold as hairpin-like 
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structure or “knuckle” conformations when they insert into their target membranes (63, 73). 
Sequence analysis of the S proteins of different CoVs (Fig 1B) shows the presence in the pFPs of 
a Gly-Gly (GG) motif in a-, y-, and 6-CoVs or a Pro-Pro (PP) motif in B-CoVs in lineage A. As 
GG and PP motifs favor the formation of turn or hairpin structures, this observation suggests that 
the FPs of some CoVs might also adapt a hairpin-like structure when inserting into host 
membranes. In the FPs of SARS-CoV and MERS-CoV in B-CoV groups b and c respectively, 
however, neither a GG nor a PP motif is present. Of note, the FP from group 2 influenza HA also 
lacks a central GG or PP motif, but instead it forms a hairpin-like structure with G13 at the turn 
with a Trp and a hydrophilic residue immediately following G13 (74). Interestingly, a similar 
motif is also present in the pFPs of SARS-CoV and MERS-CoV (Fig 1B). 

While all known Class-III viral fusion protein have two fusion loops, all known Class-I 
viral fusion proteins except for CoV S protein only have a single fusion peptide. In the case of 
CoVs, in addition to pFP found in this study, Manu et al previously found a highly conserved 
region in SARS-CoV S protein essential for membrane fusion and proposed it as the possible 
fusion peptide (32), although this sequence lacks some common features of FPs of other Class-I 
viral fusion proteins, including high Ala/Gly content. Their proposed FP is about 80 amino acids 
away from the N-terminus of HR-N (50, 52) and about 40 amino acids upstream of the N- 
terminus of our pFP. The possibility of presence of two possible fusion peptides in the S protein 
of CoV is very intriguing. How these two possible fusion peptides collaborate to mediate 
membrane fusion requires further investigation. 

In summary, using a bioinformatics approach we have identified a region in the S 
proteins of CoVs that has several properties the FPs of several classical Class-I viral fusion 


proteins. Further molecular biological, biochemical, and biophysical analyses demonstrated that 
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this region is essential for receptor-dependent membrane fusion mediated by S proteins of 
several B-CoVs in different lineages, strongly suggesting that it is the functional FP of these and 
likely other CoVs. These findings will provide significant clues for future studies of the 
membrane fusion mechanism of CoVs and may provide a new target for drugs against CoV 
infections. 
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Figure legend. 

Figure 1. pFPs of CoVs. (A) Diagram of CoV spike protein. NTD, N-terminal domain; C- 
domain, C-terminal domain; Cleavage site, protease cleavage site separating S1 and S2; pFP, 
possible fusion peptide; HR-N, N-terminal heptad repeat; HR-C, C-terminal heptad repeat; 
JMD, Juxtamembrane domain; TMD, transmembrane domain. (B) Locations of pFPs and TMDs 
of S proteins of representative CoVs predicted by TMPred. (C) Amino acid sequence alignment 
of the pFPs of different CoVs. (D) Summary of the amino acid substitutions made in the pFP of 
MERS-CoV S protein. 

Figure 2. Analysis of expression of pFP mutants of MERS-CoV S protein in 293T cells. (A) 
Western blot analysis of expression of WT or mutant MERS S protein in cell lysate. The MERS 


S protein was detected by using mouse monoclonal anti-MERS S antibody; B-actin was detected 
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with mouse monoclonal anti-actin antibody. (B) Analysis of surface expression of mutant 
MERS-CoV S protein by flow cytometry. MERS-CoV S protein expressing 293T cells were 
stained with mouse monoclonal anti-MERS S antibody. The amount of wild-type S protein on 
cell surface was set as 100%. All of the experiments shown were repeated at least three times. 
Figure 3. Receptor binding by mutant MERS S proteins. MERS-CoV S protein expressing 293T 
cells were incubated with soluble AVI-tagged hDPP4, followed with polyclonal rabbit anti-A VI 
antibody and FITC conjugated goat anti-rabbit IgG. The results from wild-type were set as 100%. 
Figure 4. Cell-cell fusion mediated by WT or mutant MERS-CoV S protein. MERS-CoV S 
protein expressing 293T cells were transiently transfected with eGFP, then incubated with 
HeLa/hDPP4 cells for overnight in the presence of trypsin. 

Figure 5. Quantitative analysis of syncytia formation mediated by WT or mutant MERS-CoV S 
protein. Cell-cell fusion was quantified by measurement of luciferase activities. Typically, the 
relative luciferase activities from cell-cell fusion induced by wild-type S protein were over 10’, 
while the reading for mock control was less than 1000. The experiments were done at least three 
times. 

Figure 6. Entry of pseudotype virions with wild-type or mutant MERS S protein. A. Entry of 
pseudovirions with wild-type or mutant MERS-CoV S proteins into HeLa/hDPP4 cells. 
Pseudovirus entry was quantitated by luciferase activity at 40 hrs post inoculation. A typical 
transduction by wild-type S protein pseudoviruses resulted in increase of over 10,000-fold of 
luciferase activity. The experiments were repeated at least three times and an average of three 
experiments is shown. B. Detection of wild-type or mutant S protein incorporation into 
pseudovirions by western blot analysis. MERS S protein was detected using mouse monoclonal 


anti-MERS S antibody; p24, a gag protein of HIV, was detected using rabbit polyclonal anti-p24 
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antibodies. FL S: full length S protein. The experiments were repeated twice and a representative 
is shown. 

Figure 7. Biophysical analysis of synthetic pFP peptide of MERS-CoV. A. CD analysis of 
secondary structure of pFP of MERS-CoV S protein. CTRL: KWGQY TNSPFLTKGF-Ahx- 
KKK, a control peptide from previous SARS-CoV peptide study (33); HIV FP: 
AVGIGALFLGFLGAAG-Ahx-KKK; MERS pFP: SSLLGSIAGVGWTAGLSSFAAI-Ahx- 
KKK. All peptides were dissolved in PBS, and their CD spectrum was measured in the presence 
of indicated concentration of TFE. Experiments were done twice and one representative is shown. 
B. Lipid mixing induced by synthetic pFP of MERS-CoV S protein. LUVs were made with 
equal moles of PE, PC, and cholesterol. The extent of lipid mixing was determined by 
monitoring the changes in fluorescence intensity at 535 nm at 37°C upon addition of peptide. 
Each data point is averaged from three independent experiments, and error bars represent 
standard deviations of the means. CTRL: KWGQYTNSPFLTKGF; HIV FP: 
AVGIGALFLGFLGAAG; MERS sFP: IAGVGWTAGL; MERS mFP: IAGRGRTAGL. 

Figure 8. Effects of mutations at the pFPs of SARS-CoV and MHV on pseudovirus transduction 
and cell-cell fusion. A, D. Entry of wild-type or mutant SARS-CoV S protein pseudovirions into 
293/hACE2 cells (A) or MHV S protein pseudovirions into HeLa/nCEACAM 1a cells (D). 
Pseudovirus entry was quantitated by luciferase activity at 40 hrs post inoculation. The 
experiments were repeated at least three times and average of three experiments is shown. B, E. 
Detection of wild-type or mutant S protein of SARS-CoV (B) or MHV (E) incorporation into 
pseudovirions by western blot analysis. SARS S protein was detected using rabbit polyclonal 
anti-SARS S1 antibody; MHV S protein was detected using goat polyclonal anti-MHV S 


antibody AO4; p24, a gag protein of HIV, was detected using rabbit polyclonal anti-p24 
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antibodies. The experiments were repeated at least three times and one representative is shown. 
C, F. Cell-cell fusion mediated by mutant SARS (C) or MHV (F) S proteins. Experiments were 
performed as in Fig. 3B, except that 293/hACE2 cells were used as targets for SARS-CoV S 
protein (C) and HeLa/nCEACAM 1a cells were used as targets for MHV S protein (F). An 
average of three experiments is shown. 


Figure 9. Alignment of TMDs of S proteins of representative CoVs. 
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Cell-cell fusion and pseudovirion transduction: +++, >70% of WT; ++, 31-70% of WT; 

+, 5-30% of WT; -, <5% of WT. ND, not done. 


>= 

(o>) 
me} 
i) 
= 
> 
sg 
fo) 
oT 
= 
= 
= 
fe) 
© 


S protein 
SI $2 
Cleavage site f1, sin’ “ 
B 
229E NL63 FCOV TGEV 


“il 5 i| = Pile ti 
"i yi N Al Hh Ni wi wi i %4| we AW my i | 
z \ z 


| "] 


MHV | OC43 | HKU1 | SARS | 
BE ie ota eT 
| Lt ry | 

iy Awa wil i} Mi i | AN Alt AYN NN : 
ahi | 


=> 
—_ 
= 
a 
= 
= 
= 


D 
= 
C 
=e) 
O 
= 
2) 
O 
jae 
(o. 
ts 
3) 
2) 
=) 
Cc 
= 
=e) 
.D 
a 
D 
O 
9) 
< 


MERS HKU9 | IBV | BULBUL | 


: a: mt: i: | 


\, } ly | 
Nal Wail Ny ii yh | ~ PA A Nhe) 
AM wll Waa | i Wile ‘Why ' Mi, = ‘wi iN i W Vy 


{re { ™o 


Group 


769 | alpha 

950 | alpha 

1056] alpha 

39 10391 alpha 
s 948 | betaA 
op) 992 | betaA 
2 982 | betaA 
g 979 | betaA 
> 7880 betaB 
are) MERS 947 ;SIAGVGWTAGESS-F. 9972 | betac 
are} HKU4 946 SSIAGAGWTAGLSS-F. 971 | betac 
3 HKUS 870 GAVASSGYTFGITS- 3895 betaD 

2 IBV 757 ASMAFG-~--GIEA 778 gamma 
4 Bulbul 737 758 delta 


MESS S&S SILL GSE AGC VGWTRAGUS S FAAT PF 


S89G G - 

MISO ay Was: ie cad cosas We gate amen cm Sey iy Se wt a aot Se ges Oe ey 

WIG - - G----+--+-+-+-+--+ -++-+-++-++-- 

WOGhe wove Qk mao Sue UMAR OK Hs 

LOS2A - - - A- - . ia | fe - - - - - . - - . - - - 
LOSIGL9S2G - - GG-+-+-+- - 

QE = sc st Ea Ss LF : 

ons Sis eR Se SS ee 8 

ROG: S) as. at Glew ve ex # & . 

SHR = - - = - Re - : a ee : 

WSG - - = + +--+ G- -- - - 

AaysevV - - - - - - - ¥ 

ASB SH a woe Se Res = - 


GIA - = 2 ee eee A ee = te om 
GiGlce: = <e: = te we wm Raza wer 8B ee we 


V98G - + - +e ee Gio s . 
Mos =) SS SHER SS Si Res = 
DSGNEG os a we «IG & G 


GIN = 2 ee ee ee te A ee ee ee 
GIR - - - = ee ee ee Re ee ee ee ee 


W960G - G : 

WOR - = - - = + 2 = ee =e R : 
V9S8GIWI0G - - - - - - = = + G+ Ges - - - 

THIG = = © os te ee em epee Gio os ° 

AW2V- - © © ee ee ee ee ee 

AMR - = = = - se 5 ee eee eRe 

ORCA ac Ge) Bel me ey lg es! a te eS OS A 


GOR = se ee se es eee we Rw Be ee 
COBAG in 2 ke, 1 1, 1 ce fe ey oe mw te 8 Ee es ow is 


7) i a a a | Se 
SSG Be Cte a ee Sm. HE GS 
SSR - = = + 2 ee ee eee ee ee Re 
SH6EG - - - = = © © eee ee eee ee Ge 
SHER - se ee eee ee ee ete ee Re 
FMIG - - 5 6 ee ee ee te ee ee ee Oe 


IMGUPSMIG si 2 Ce CRUSE RRS Se HIE Es 
AMBR - = = = ee ee ee eee ee ee 
AMV = swe ee eee ee ee 
AMOR - = - ss ee ee ee ee ee ee 
ONGis: = ew ey Re eR ew OF ee a ee 
SETI er, x eh fe Doh es Ne SE el) ee hee fe “ete ond ee oe 
PG - - = = = 2 ee et te tet ete 

1970GIFIT2G - 


Journal of Virology 


92264/90Z61 
92264 


FL.S 
| Cleaved S 


yOs6M 
SO096M 
y6S69D 
V6S69 
Oss6NV/OSSE! 
Yss6A 
O8S6A 
uyZS69 
VLS69 
yos6V 
A9S6V 
SSsé6l 


92263/9026! 
92264 
AltZ6d 

S026! 

Y696V 

A696V 

ysg6v 

A896V 
9965/9961 
92964 

y996S 


Yss6A 
uvses 
Ovs6s 
yes6d 
Vves69 

O7S6V91S67 
VvzS61 
97561 
1S61 
90s6S 
6r6S 

adAL PIM 
4OOW 


9996S 


uyS96S 
OS96S 
urge) 
S961 
Ye96D 
VE96D 
Yz96V 
AZ96V 
SL96L 
SOI6MWO8S6A 


VzS61 


—_- 


—SS———SS—_———_S—T_. 
ate FL S 
Cleaved S 
—_—_— 
LCOCOILLOCOO>XKaCKororeo 
DESH HHBOSOOSGOBOSOS 
AAOADMAMOMAIADAOAHADMPAN9MDH9HDHHDOH a 
OF>S00SSSF I C00400H 
re 9 
w Ye) 
2 fz) 
> 


© 
tT N =) ioe) <e) vT N 
on 


—_ —_— 


(adAL pl JO %) uoIsseidxe soRyiNs |]9a5 


150— 
100— 


1 | 
o Oo 
nm oO 
- 


<x 


SUI|UO Ppojsod Jdiosnupyy paydaooV7 ABojoui, JO JOUsNoF ABoJOIA JO OUINOL 


922645/50Z61 
92264 


S296 V/SDLS61 
Vzs61 
92961 
S-S61 
S0s6s 
O6r6S 

8GAL PIIM 
490 


=] i=] oO f=] [=] o i) 
N Oo foe) © v ™N 
= 


(adAL piiM sO %) Apanoy Bulpuig 10}da094y 


SUI|UO) Ppojsod Jdiosnupyy paydeaoV7 ABojosl, Jo jOwNor ABo|ouIA JO OUINor 


Accepted Manuscript Posted Online 


Journal of Virology 


JVI 


Journal of Virology 


L951G/L9526 


G95oR 


A962V 


L964G/F967G 


IS70G/E972G 


ISS58G/V958G 


V958G6/Ws60G 


92264/90Z61 


A896V 


S096M/D8S6A 
YO96sM 
S096M 

u6S69 

V6S69 
O8S6AN/OSS6l 


9296 V/DLS61 
Vzs61 
92961 
S-S671 
S0s6S 
S6r76S 
SaAl PIIM 
490 

o =) —] =} o r—) 

=) To) t N 


120 
0 


(adAy pjiM Jo %) UOISNY |]99-[]2D 


SUI|UO Ppojsod Jdiosnupyy paydeooV7 ABojoui, JO JOUsNoF Bool, JO OUINOL 


92265/90Z61 


92264 ” 
AtL6d v 
S026! YB 
y696V 2&8 . 
A696V a 3 
sey ca aby 
52964/Dv961 096M zo a 
30968 Y6S69 9764/9026! 
osee v6S69 9764 
eoen Oss6//OSSEI 
57961 ueséA| § § 
use oes | 
ped ylS69 
S961 VlS69 
Q9BEIVOREA sn 
Sosent Ags6v 91964/9 961 
SeseN/OSSEl anie ©1863 
Ea upsées BOIes 
pied YES6O | yS96S 
SS6l VEs69 9596S 
ete OZSEVOLSET ! 4967 
z2S6 V/91S61 92561 YEI6D 
eee S156) veg969 . 
Banna 9056S en 
Saat PUM Sores aan RevEN: 
490 SdAL PIM OL96L 

§ S & 3 <4 4 - yoo OOIBMWOBSEA 

~e (adA1 PIIM JO %) SOMIAHSY asesayION 7] fava 


SUI|UO Ppojsod Jdiosnupyy paydaooV7 ABojouiA JO JOusNor ABojoul, JO |OUINOL 


A —— 0% TFE 
40 rn 25% TFE 
CTRL =e 
—— 95% TFE 


= 
8 


2 


3 
[010 degem dmol ) 
o 


8 


© 
= 
C 
ao) 
D 
= 
2) 
O 
jae 
(o. 
= 
3) 
2) 
=) 
Cc 
= 
ao) 
.D 
om 
D 
O 
9) 
— 


—0% TFE 
25% TFE 
40 —— 50% TFE 


HIV FP —— 75% TFE 


—— 95% TFE 


nN 
c=) 


[e(10 degem dmol ) 
2 o 


rs) 
S 


-40 


——0% TFE 
——— 25% TFE 
40 —— 50% TFE 


» MERS pFP 


—— 95% TFE 


-1 
[e410 deg aan dmol ) 
o 


& 
Ss 


Journal of Virology 
3 


-40 
190 200 210 220 230 240 250 260 270 280 
Wavelength (nM) 

B 
-> CTRL 
—» HIVFP 
~~ MERS sFP 
—* MERS mFP 


Lipid Mixing (%) 


Ratio (peptidellipid) 


Journal of Virology 


SUI|UG pejsod| jdiosnuDy) paydeoaV7 


(LM 40 %) 
Ayaijoe asesayion| 


|oszevo9ze7 
yp 58281 
| uezel 
HE 9287 
+ H9Z81 


90284/9898M 
mm 0010: 


SARS-CoV 


(adKL PLM $0 %) 
salAloy asesayion7 


<x fa'a) 


FLS 


100 
75 
si 
25 


oesgeg gs 
ass 828 


FLS 


+ 
nN 
a 


uoIsny 1M 40 % 


un 
ao} 
o 
> 
& 
zx 
U 


20 


Se 
cy 


8828 ° 
uoIsny 1M 40 % 


P24 
12 
100 


U 


ABojosl, JO JOUINOL 


ABo|o.IA 4O JOUINOL 


(0) 
= 
6 
me) 
(0b) 
a 
Wn 
O 
(ai 
mame: 
(OL 
— 
O 
ny 
=) 
= 
= 
me) 
® 
oj 
os 

(0) 
©) 
©) 
<< 


IBV 
Bulbul 


a ae 


TAGIWCLVLAVIMLM@MANG 
AFATIIFILIMGWLFFMiere 
FLAIAWPACIIVTIFLhyele 


